AD-A065  519 


UNCLASSIFIED 


AD 
AO  6 


FLORIDA  STATE  UNIV  TALLAHASSEE  DEPT  OF  STATISTICS  F/6  12/1 

ASYMPTOTIC  NORMALITY  AND  EFFICIENCIES  OF  TESTS  BASED  ON  MODIFIE— ETC <U> 
JAN  79  J S RAO*  J SEThURAMAN  DAAG29-76-6-0238 


FSU-STATISTICS-MUB9 


AR0-13888.15-MX 


ASYMPTOTIC  JJORMALITY  AND  EFFICIENCIES  OF  TESTS 
» BASED  ON  MODIFIED  SPACINGS  ♦ ^ 


University  of  California 


Florida  State  University,  Tallahassee 


J . /Sethuraman 


FSU  Technical  Report  No.  M489 
USARO-D  Technical  Report  No.  30  * 


The  Florida  State  University 
Department  of  Statistics 
Tallahassee,  Florida  32306 


N\  V <P9. 


■he  United  States  Army  Research  Office  under 
38*  /The  United  States  Government  is  authorized 

lental  purposes. 


Research 

Grant  No.  (DAAG29-76- 
to  reproducer and^dis 


«5T(«/lMIUSi!in  KJ3 


DISTRIBUTION  STA\ 

u 

uititomu  u 

Approved  for  pub.  : : 

Distribution  Uni: 

D 

ASYMPTOTIC  NORMALITY  AND  EFFICIENCIES 
OF  TESTS  BASED  ON  MODIFIED  SPACINGS 


J « S • Rao 

University  of  California,  Santa  Barbara 
and 

J.  Sethuraman^ 

Florida  State  University,  Tallahassee 

ABSTRACT: 

Tests  based  on  adjusted  or  modified  spacings  are  proposed  for  testing 
goodness  of  fit  problems.  The  weak  convergence  of  the  empirical  distribution 
function  of  such  modified  spacings  is  studied  using  some  earlier  results  of 
the  authors.  The  asymptotic  theory  under  close  alternative  sequences  is  also 
given  thus  enabling  one  to  calculate  the  asymptotic  relative  efficiencies 
of  such  tests. 


Research  supported  in  part  by  the  National  Science  Foundation  under 
Grant  No.  MCS78-01881. 

p 

Research  supported  by  the  United  States  Aimy  Research  Office,  Durham, 
under  Grant  No.  DAAG29-76-G0238. 


AMS  1970  subject  classification  numbers:  Primary  62E20,  62G20,  62G10; 
Secondary  6OFO5.  Key  words  and  phrases:  Modified  spacings,  empirical 
distribution  function,  weak  convergence,  efficiencies  of  tests. 


ASYMPTOTIC  NORMALITY  AND  EFFICIENCIES  OF  TESTS  BASED  ON  MODIFIED  SPACINGS 


J. S.  Rao,  University  of  California,  Santa  Barbara 

and 

J.  Sethuraman,  Florida  State  University,  Tallahassee. 


1.  Introduction  and  Summary. 

Let  X^,Xg,...,Xn  1 be  (n-l)  independently  and  identically  dis- 
tributed random  variables  with  a comnon  distribution  function  (d.f. ) 

G(x).  The  goodness-of-fit  problem  is  to  test  whether  G(x)  is  a 
specified  d.f.  . When  the  latter  d.f.  is  continuous,  a simple  prob- 
ability transformation  on  the  random  variables  would  permit  us  to 
equate  the  pre-assigned  d.f.  to  the  uniform  d.f.  on  [0,1].  Therefore, 
from  now  on  we  assume  that  this  reduction  has  been  effected  and  that 
under  the  hypothesis,  G(x)  is  the  uniform  d.f.  on  [0,1]. 

Let  X£  s *2  S "*  S Xn  1 the  order  statistics.  The  sample 
spacings  (D1,...,Dq)  are  defined  by 

Di  = Xi  " xi_i»  1 = l»***»n 

where  we  put  X/  = 0,  X ' = 1.  In  order  that  this  definition  of  the 
o n 

sample  spacings  be  meaningful  under  any  alternative,  the  d.f.  G(x) 
must  have  the  carrier  [0,1].  (The  carrier  of  a d.f.  is  the  smallest 
closed  set  with  probability  one. ) 

Under  the  null  hypothesis,  E(D^)  = l/n  for  all  i.  We  will  there- 
fore call  fnDj^,  i = l,...,n]  as  'normalised'  spacings.  Further  if 
hnl , . . . ,hfin  be  some  positive  numbers,  then  we  shall  call  (nD^/h^,  i = 1, 
as  'modified'  or  'adjusted'  spacings.  For  example,  one  way  of  adjusting 
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the  spacings  is  to  divide  them  by  their  expectations  under  seme  arbitrary 

d. f.  . The  rationale  behind  dealing  with  the  modified  spacings  is  that 
in  seme  cases  one  may  choose  to  adjust  the  spacings  by  their  expectations 
under  some  alternative  distribution  to  increase  the  efficiency  or  other- 
wise, to  enlarge  the  class  of  statistics  based  on  spacings.  Ibis  may  be 
thought  of  as  being  analogous  to  the  use  of  Normal  scores  and  other  scores 
in  rank  tests  (see  Section  6 for  details).  Tests  based  on  spacings  have 
been  proposed  for  the  goodness-of-fit  problem  by  several  authors.  See 

e. g. , Greenwood  (19^6),  Kimball  (1950),  Sherman  (1950)  and  Darling  (1953)* 
Distribution  theory  of  such  statistics  and  their  asymptotic  efficiencies 
have  been  studied  by  Sethuraman  and  Rao  (1970 ) and  Rao  and  Sethuraman 
(1975).  Since  these  two  papers  are  closely  related  to  the  present  work 
and  are  referred  often,  we  will  refer  to  them  as  SR  and  RS  respectively. 

The  present  paper  is  devoted  to  the  study  of  asymptotic  distributions 
and  asymptotic  relative  efficiencies  (ARE's)  of  tests  based  on  modified 
spacings.  As  pointed  out  in  SR,  for  calculating  Pitman  efficiencies,  it  is 
enough  to  obtain  the  limiting  distribution  under  a sequence  of  alter- 
natives which  converge  to  the  hypothe  s is . Ibis  problem  turns  out  to  be 
somewhat  simpler  as  can  be  seen  from  the  cases  treated  by  Cibisov  (1961) 
and  Weiss  (1965).  Se  also  RS  and  SR.  We,  therefore,  choose  the  following 
sequence  of  alternatives 

G(x)  = x + L(x)/n®,  0 s x < 1 

where  S is  a number  a l/4  and  L(x)  is  twice  continuously  different- 
iable on  [0,1].  These  conditions  imply  our  earlier  requirement  that 
the  carrier  of  G(x)  be  [0,1].  We  shall  say  that  this  alternative  is 
at  a distance  of  order  n”^  from  the  hypothesis. 
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As  in  RS  we  obtain  the  limiting  distributions  of  the  empirical  dis- 
tribution functions  of  the  adjusted  or  modified  spacings  in  the  sense 
of  weak  convergence  of  measures  in  D[0, «]  the  space  of  functions  on 
[0,« ] with  no  discontinuities  of  the  second  kind.  This  is  done  through 
some  interesting  results  of  independent  interest  concerning  the  empirical 
distribution  functions  of  perturbed  random  variables  and  randomly  scaled 
randan  variables  which  are  given  in  Sections  3 and  ^ • Appealing  to  the 
invariance  principle,  we  immediately  have  the  limiting  distributions  of  a 
large  class  of  statistics  which  are  symmetric  in  the  modified  spacings. 

As  shewn  in  RS  and  SR , tests  which  are-  symmetric  in  the  normalised 
spacings  have  limiting  power  greater  than  the  test  size  only  if  b = lA 
i.e. , they  cannot  discriminate  alternatives  which  are  at  a distance  of 
order  n"^  from  the  hypothesis,  for  any  d > l/k.  It  is  also  shown  there 
that  among  the  many  such  standard  tests  due  to  Greenwood  (19*4-6),  Kimball 
(1950),  Sherman  (1950)  and  Darling  (1953),  the  one  by  Greenwood  based  on 

ft 

fj  (nD.  f/n  has  the  wayt  mm  ARE.  We  note  another  interesting  feature 
i=l  1 

of  the  symmetric  spacings  tests,  namely  their  ARE's  do  not  depend  on  the 
particular  choice  of  the  alternative  sequence  i.e.,  L(x). 

These  features  are  not  shared  by  tests  symmetric  in  the  modified 
spacings,  which  are  discussed  in  Section  6.  Here  the  efficiencies  depend 
on  the  alternative  sequence  as  may  be  seen  from  the  expressions  (6.5) 
and  (6.6).  Also  these  tests  based  on  modified  spacings  can  in  general, 
distinguish  alternatives  at  the  more  standard  distance  of  n ^ , though 
there  may  be  exceptions  as  shown  by  an  example  (see  Section  6 for  details). 

The  reader  not  interested  in  the  detailed  derivations  may  skip 
Sections  2,  3 and  k and  go  over  to  Sections  5 and  6 where  the  efficiency 
comparisons  have  been  made. 


2.  Preliminaries 


Let  X^,Xg,. . . »Xn_^  be  (n-l)  independently  and  identically  dis- 
tributed random  variables  with  a continuous  d.f.  G (x),  whose  carrier 
is  ro,l],  n «•  2,3,...  . X^,...,Xn  ^ may  also  depend  on  n,  but  we 
shall  suppress  this  in  our  notations  throughout.  Gn(x)  is  a sequence 
of  alternative  distributions,  which  converges  to  the  uniform  distribution 
on  [0,1],  the  distribution  specified  by  the  hypothesis. 

Assumption  (A); 

We  assume  Gn(x)  to  be  of  the  form 

(2.1)  Gn(x)  = x + L(x)/n3 

for  x € [0,1]  where  5 is  a fixed  constant  * l/4.  We  impose  the 

following  regularity  condition  on  L(x).  L(x)  is  twice  continuously 

differentiable  on  [0,1].  If  t (x)  and  l '(x)  denote  the  first  and 

second  derivatives  respectively  of  L(x),  then  we  note  that  there  is  a 

constant  L such  that 
o 

(2.2)  L(x) | s Lo,  U (x) I * Lq,  |A'(x)|  * Lq 

for  all  x € [0,1]. 

The  inverse  function  of  Gq(x)  is  denoted  by  G~1(p),  0 * p * 1. 

We  define 

(2.3)  kn(p)  = gn[G;1(p)]  = [dG;1(p)/dp]"1. 

It  may  be  verified  that  in  our  case 

(2.4)  G~^(p)  = p-L(p)/n3  + o(l/n3) 

(2.5)  kn(p)  = l + i(p)/nB- L(p)i /(p)/n2a  + o(l/nS+a  ) 
where  o(»)  is  uniform  in  p and  d = max(0,  £-b). 


We  will  obtain  several  limit  distributions  under  the  sequence  of 

alternatives  G (x)  satisfying  assumption  (A).  It  is  clear,  however, 
n 

that  the  limit  distributions  under  the  hypothesis  are  obtained  by  putting 
L(x)  = 0.  We  will  make  some  further  remarks  about  these  alternatives  in 
Section  6.  Let  the  random  variables  (r.v. 's)  X^, . . . be  arranged 

in  increasing  order  of  magnitude  thus 

(2.6)  o * * •••  * * 1. 

The  sample  spacings  have  been  defined  in  Section  1 as 

(2.7)  ®i  = ^i  “ ^ = 

where  we  put  = 0,  X^  = 1. 

We  first  relate  these  sample  spacings  D.  to  the  spacings  based  on 
uniformly  distributed  r.v. 's  on  [0,1]  (to  be  called  uniform  sample 
spacings).  Let  ui»*«.»unl  (n-l)  independently  and  identically 

distributed  r.v. *s  with  a uniform  distribution  on  [0,1].  These  are 
arranged  in  increasing  order  of  magnitude  thus 

0 *jj*  * s U'  s 1. 

1 n-± 

The  uniform  sample  spacings  are  defined  by 

(2.8)  Ti  = Ui  “ Ui-1»  1 = 1»***»n 

where  again  we  put  = 0,  tT  = 1. 

For  two  r.v. 's  X and  Y,  we  write  X~Y  to  mean  that  X and  Y 
are  distributionally  equivalent,  that  is,  the  distributions  of  X and  Y 
are  identical.  We  know  that 

(X^,  i - 0,...,n)  ~ (O^(U'),  i = 0,...,n) 


and  thus 


(D  , i - l,...,n)  ~ (G"X(up  - G’1(U^1),  i = 


(T^k  fa),  i ■ l,...,n) 


where  s s 


(2.9) 


= (Ti/“ni»  1 = 1»***»n^ 


■where 


(2.10) 


with 


1 + e>a  + v*  /=2S  - 


(2.11) 


®ni  * l(6i> 


(2.12) 


V*  - -Ku^t'lUj) 


(2.13) 


sup^  |R*|  -♦  0 almost  everywhere 
i nl 


in  view  of  (2.5).  Also,  from  the  existence  of  the  limiting  distribution  of 
the  Ko imo gorov-Smirnov  statistic, 


(2.14) 


supv^n  |u^-i/n|  = 0p(l). 


Thus  from  the  continuity  of  L,  l and  i/, 


(2.15) 


(2.16) 


* 

sup  n3  Ib*^  - B(i/n)  I = op(l) 


sup  Iy*  -y(i/n)l  = o (l) 
i ni  p 


where 


(2.17) 


B(p)  = £ (p) 
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(2.18) 


Y(P)  - -L(p)i'(p),  0 * p * 1. 


Now,  let  W, , . . . ,W  be  n independently  and  identically  distributed 
i n 

exponential  r.v. ’s  with  density  function  e w,  w a 0.  Ifit  W*  = (W^+***  + Wn) 

and  let  W = W / n.  Then  it  is  well  known  that 
n n 

(Tt,  i = l,...,n)  ~ (Wj/W*,  i»l,...,n). 

Thus  (2.9)  may  be  rewritten  as 


(2.19) 


(DiS  i = 1,« « • ,n)  ~ (W^/ of^i  Wn,  i = 1>...  ,n) 


where 


(2.20) 


(o^p  1 - lj***>n)  ~ (o'r)^ i 1 = l»***»n)* 


In  view  of  (2. 20),  we  save  on  notation  by  writing  ar^  for  and 

retain  its  structure  defined  in  (2.10)  and  will  later  on  utilise  the  prop- 
erties (2.13),  (2.15)  and  (2.16). 


The  empirical  d.f. , Hq(x)  of  the  normalised  spacings  is  defined  as 


follows 


(2.21) 


Hn(x)  = ^ I(nDi;  x)/n,  x * 0 


where 


(2.22) 


I(z;  x)  = 


1 if  z * x 


0 if  z > x 


Using  the  equivalence  (2.19),  we  note  that 


(Hn(x),  x * 0)  ~ { E I(wi/QfiWn;  x^n>  X * 0} 


(2.23) 


= (F*(xWn),  x a 0) 
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(2.24) 


n 

F*(x)  = £ 1(W1/«JL;  x)/n. 
1 


Relation  (2.23)  says  that  the  distributions  of  the  stochastic  processes  (Hn(x) , x t 
and  ( £ I(wi/oni  Wn;  x)/n,  x ^ o)  in  D[0,  ®]  coincides  and  this  distribution- 
al equivalence  is  s"u  jnger  than  the  distributional  equivalence  of  the  finite 
dimensional  marginals.  We  refer  to  Fr(x)  as  the  empirical  d.f.  of 
W^, . . . ,Wn  with  random  perturbations  and  a random  scale  factor  Wq. 

If  <\l’  n = 1,2,...  be  a triangular  array  of  positive 

constants,  define 

(2.25)  = nD^/h^,  i — 1, ...,n. 

* * 

We  shall  call  (D1,...,Dq)  modified  spacings  modified  by  h^, . . . ,hm  and  the 
empirical  d.f.  of  these  modified  spacings  is  defined  by  H*(x)  where 


ally  equivalent  as  follows: 

(2.28)  (a*±t  i = l,...,n)~(hni(l  + 0n*/n3+  y*./n2a  + B^),  i = 

where  0n^,  y*^  and  R^  satisfy  the  conditions  laid  down  in  (2.15),  (2.l£) 
and  (2.13).  As  in  the  remark  after  (2.20),  we  replace  the  symbol  ' ~ ' in 
(2.28)  by  ' = ' in  order  to  avoid  introducing  new  notations. 


iWM* 


3.  Asymptotic  distribution  of  the  empirical  d.f.  of  random  variables 
sub.lect  to  perturbations. 

Let  Z^jZg,...  be  independently  and  identically  distributed  r.v.  's 
with  a common  d.f.  F(x)  with  F(o)  = 0.  We  assume  that  F(x)  is  thrice 
differentiable.  Let  f(x),  f'(x)  and  f" (x)  denote  the  first,  second 
and  thi.  d derivatives  respectively  of  F(x'.  We  impose  the  following  blanket 
condition  (B)  on  F(x). 

Assumption  (B): 

xf (x) , x2f'(x)  and  x^f" (x)  are  bounded  on  [0,®]. 

Let  fa  i = l,...,n:  n = 1,2,...,}  be  a triangular  array  of 
nil 

constants.  Then  the  random  variables 


Zni  ' V“nil’  i = 1»***»n 


are  said  to  be  perturbed  random  variables,  n = 1,2,. 


(3-D 


Fnl(X)  = T,  I^Zni;  X^n 


- t l(  V°nil’*  x)/n' 


We  refer  to  F^Cx)  as  the  empirical  d.f.  of  (Z^, ...,Zn)  under  a 
perturbation  by  the  non-random  quantities  [o^.. -j , i = 1, ...,n}. 

The  following  structure  is  assumed  of  to^-j , i = l,...,n).  There 
exist  continuous  functions  f3(p)  and  y(p)  on  [0,1]  such  that 


(3.2) 


a .,  = 1 + B(i/n)/n3  + v(i/n)/n  9 + Rni 


where  3 is  a constant  a l/k  and 


(3-3) 


sup  |R  . | -*  0 as  n -♦  ®. 
i nl 


It  is  easy  to  see  that  F^(x)  tends  to  F(x)  uniformly  in  x. 

’ 


F(x)  if  a > 1/2 

1 * 

F(x)  + xf (x)  J*  B(p)dp/nd  if  lA  < 9 s 1/2 

O 

F(x)  + xf  (x)  J,*B(p)dp/n1'^ 

+ [xf(x)jS(p)dp  + x2f'(x)jV(p)dp/2]n*  if  S 


after  omitting  terms  which  are  of  smaller  order  than  n"  ' uniformly  in  x, 
The  most  general  conditions  under  which  F^(x)  can  be  replaced  as  above 
must  depend  on  &.  However  since  we  are  contemplating  only  the  application 
with  F(x)  = 1 - exp(-x)  in  Section  6,  we  will  content  ourselves  by 
imposing  the  blanket  condition  (B). 
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The  following  theorem  which  establishes  the  weak  convergence  of  fr^Cx),  x * 0) 
taken  from  RS  , where  the  weak  convergence  on  the  space  D[0,  ®]  is  also 
briefly  explained 

Theorem  3.1  (rsU  Let  condition  (B)  hold.  The  sequence  {^^(x),  x s 0} 
considered  as  a stochastic  process  in  D[0,  ®]  converges  weakly  to  a Gaussian 
process  {^(x),  x * 0)  with  mean  zero  and  covariance  function 

(3.8)  K^Xjy)  = K^(y,x)  = F(x)(l-F(y))  for  x * y.  | 

To  allow  for  perturbations  by  constants  which  are  more  general  than  given 
in  (3*2),  we  consider  a triangular  sequence  * = n = 1,2,... 

with  the  following  structure 

(3-9)  “ni2  = e(i/n)[l  + B(i/n)/n*  + vUAO/n^  + R^] 

where  8(p)  and  Y(p)  are  continuous  functions  on  [0,1]  and  R ^ 
satisfies  (3-3)-  We  put  the  following  condition  (C)  on  9 (p). 

Assumption  (C); 

e(p)  is  continuous  on  [0,lj  except  at  a finite  number  of  points  and, 
for  each  x,  the  integrals  of  F(xe(p)),  0(p)f(x0(p))  and  9 (p)f'(x9(p)) 
as  functions  of  p on  [0,1]  exist  and  are  finite. 

We  shall  see  later  that  this  generalisation  gives  results  which  enable 
us  to  obtain  the  limiting  distributions  of  statistics  based  on  modified 
spacings.  Define  the  empirical  d.f.  (x)  of  the  Z's  perturbed  by  the 
[0^2 ) given  in  (3.9)  , by  a formula  similar  to  (3-l)*  Let  F^,(x) 
r^2(x)  be  as  defined  in  (3*5)  and  (3-6),  with  the  perturbation  constants 
((*ni2 ) instead  of  The  following  theorem  is  proved  exactly  as 

Theorem  3.1  and  is  stated  without  proof. 


Theorem  3.2:  Let  assumptions  (B)  and  (C)  hold.  The  sequence  of 
stochastic  processes  (i-^^x),  x & ^ converges  to  the 

Gaussian  stochastic  process  (rig  (x)»  x a 0}  with  mean  zero  and  covariance 
function  Yi^(xty)  defined  by 

1 

(3.10)  Kg(x,y)  = K^x)  = J*  P(xe(p))[l- F(ye(p))]dp  for  x * y. 


Remark  2;  Under  conditions  (B)  and  (C),  F^(oc),  which  is  defined 

by  (3.5)  through  the  constants  (o^g)  of  (3*9)  can  replaced  by 


/ 


(x)  =■< 


(3.H) 


1 . 

J F(xe(p))dp  if  3 > !/2 

0 

J*  F(x0(p))dp  + r x8(p)0(p)f(xe(p))dp/n3  if  J B * | 

0 0 

f F(x0(p))dp  +J*  x8(p)0(p)f(x0(p))dp/n1^ 

0 0 

+ T f1xY(p)0(p)f(x0(p))dIH-.r1x2B2(p)02(p)f'(xe(p))dp/2]/nl/2 

L’o  0 

if  a = iA 


up  to  terms  of  smaller  order  than 


uniformly  in  x. 


We  now  proceed  to  establish  a limit  theorem  for  the  empirical  d.f.  of 
randomly  perturbed  r.v. *s  . Let  i=l»-«»»n}  n=l,2,...  be  a 

triangular  scheme  of  random  variables  with  the  form 


(3^12) 

where 

(3.13) 


4x  - 1 + B>8  + + < 


hi 


sup//n  I = Pp(l) 


and  there  are  continuous  functions  B(p)  and  y(p) 


on 


[0,1]  such  that 


FT 


JO 


(3-1^) 


where 


(3-15) 


Let 


(3*16) 


sup  n^  iB^-BU/iOl  = o (1) 


sup  IVni-Yd/H)!  = °p(l) 


* 

5 = 


\ - a if  a < | 


if  a*t. 


FIi(x)  = 1 1(zi/(4i;  x)/n* 


u 

1 


This  is  the  empirical  d.f.  of  Z1,...,Zn  perturbed  by  the  random  quantities 
{a*  ,,  i = l,...,n).  Let  the  non-random  quantities  («nil)  be  defined  as 


(3-17) 


= 1 + B(i/n)/na  + y(i/n)/n23  + R 


ni 


in  terms  of  the  new  B(p)  and  v(p)  and  (x)  be  defined  by  the 

relation  (3.5)  with  the  new  c^'s.  We  have  the  following  result  from  RS 
on  the  process 

(3'18)  {r^W  ./T(4(x)  - **o}. 

Theorem  3»3  (RS):  Let  condition  (B)  hold.  The  sequence  of  processes 
(T*^l  xz0)  in  D[0,  ®]  converges  weakly  to  the  Gaussian  process 

(r^U),  x * 0}  with  mean  zero  and  covariance  function  given  by  (3.8). 

Now  suppose  that  (a*12,  i = l,...,n),  n = 1,2...  is  a triangular 
scheme  of  r.v. ’s  with  the  fora 


(3-19) 


°ni2  = 0^i[1  + Bni/na+  Yni/n2S+  Rni] 


where  R^,  and  satisfy  the  assumptions  (3*13)  and  (3.14) 

and  further  there  is  a function  0(p)  on  [0,l]  such  that 

(3.20)  sup /n  |e*  - 0 (i/n) | = o (1) 

i JP  • 

and  0 (p)  satisfies  condition  (C).  Now  define  F*2(x),  and  Tl^fx) 

similar  to  the  expressions  in  (3.16), (3. 5)  and  (3.18),  respectively  with 
(<*^2 ) instead  of  (g*. ).  The  following  theorem  then  follows  frcm  Theorem  3.2 
in  exactly  the  same  way  as  Theorem  3.3  follows  from  Theorem  3.1. 

Theorem  3.4:  Let  assumptions  (B)  and  (C)  hold.  The  sequence  of 
stochastic  processes  (t)^,  M,  x z 0)  converges  weakly  to  the  Gaussian 
process  frj2(x),  x > 0)  in  D[0,  •)  where  [r| 2(x),  x * 0}  has  mean  zero 
and  covariance  function  K^(xty)  given  in  (3.10). 

A note  of  clarification  regarding  the  notation  may  he  in  order.  There 
are  four  different  sets  of  perturbation  constants  in  all,  namely:  of 

Theorem  3.1  satisfying  conditions  (3 .2)  and  (3*3);  {ani2}  of  Tb®01®®  3-2 
satisfying  (3*3)  and  (3.9);  fa*^}  of  Theorem  3.3  satisfying  (3.12)-(3.l4); 

and  finally  {a*i2J  of  Theorem  3.4  satisfying  (3.I3) , (3*l4),  (3*19)  and 

(3.20) .  It  may  he  noted  that  a subscript  2 is  attached  to  denote  perturbation 
constants  with  modifying  term  9 (i/n)  in  them  as  opposed  to  the  use  of 
subscript  1 where  such  a term  is  absent.  Similarly  the  random  perturbation 
constants,  as  well  as  any  quantities  like  empirical  distribution  functions 
based  on  those,  are  starred  (as  opposed  to  the  analogous  unstarred  version  based 
on  non- random  perturbations).  Thus  ^^(x)  is  the  process  based  on  the 

non- randan  fa  which  have  the  modifying  term  0(i/n)  in  them  while 
ni2  . • 

TV*,  (x)  is  based  on  the  random  fa*. . } without  the  0(i/n)  term  in  them, 
nl  nix 

The  four  theorems  of  the  next  section  also  correspond  to  these  four  cases. 
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4.  Asymptotic  distribution  of  the  empirical  d.f.  when  the  random 

•variables  are  subject  to  perturbations  and  a random  scale  factor. 

We  retain  the  notations  of  the  earlier  sections.  Let  ri^M  be 
defined  as  in  (3.6)  through  F^^x)  and  F^(x)  which  are  in  turn  defined 
as  in  (3.1)  and  (3.5)  and  the  bave  the  structure  (3*2). 

Let  ZR  be  a r.v.  and  let  = </n ( - 1 ) • -We  now  make  the  following 
assumption  (D)  on  the  Stochastic  process  x * 0)  and  5^. 

Assumption  (D): 

For  any  finite  collection  (x^, ...» 

{^ni^i),  ri^Cxg),...,  T) nl(x^)>  §n)  converges  weakly  to  the  distribution 
of  {n1(x1),...,n1(xk),  ?),  which  is  multivariate  normal  with  zero  means 
and  covariances  given  by 

(4.1)  cov(fi1(xi),  = K^x^  Xj),  1 * i,  j<k 

where  K1(x,y)  is  as  defined  in  (3.8)  and 

(4.2)  cov(r]1(xi),  %)  = a1(xi),  i = l,...,k 
and 

(4-3)  var(?)  = 1. 

We  add  the  following  to  the  assumption  (B)  made  on  F(x)  in  Section  3. 

/ 

Assumption  (B  ); 

There  is  an  a > 0 such  that  x0f(l-F(x))  -♦  0 and  xf(x)  -♦  0 as 
x -*  ®.  Again  from  RS  we  have 


x^),  the  distribution  of 


■hLJUa 


4 

Theorem  4.1  (RS):  Let  the  assumptions  (b),  (B  ).  and  (D)  hold.  Let 

(4.U)  CajW  = (Fnl(xZn}  " FIl(x))' 

Then 

(4.5)  sup  lr-00-  ‘nnl(x)”Vf(x)l  = Opd)' 

Thus  (x),  x a 0)  converges  weakly  to  the  Gaussian  process 

{^(x)  = ti^x)  + xf(x)£,  x ^ 0)  in  D[0,  ®]  which  has  mean 
zero  and  covariance  function 

(4.6)  K3(x,y)  = K3(y,x) 

= K^y)  + xyf(x)f (y)  + xf(x)ax(y)  + yf (y)ax(x) . 

We  now  extend  this  result  to  the  more  general  non-random  perturbation 

factors  defined  in  (3*8).  I«t  ?a2^*  Fn2^  and  Tin2(x)  be 

* 

as  defined  and  used  in  Hieorem  3*2.  If  be  a r.v. , we  assume  that 
s Jn  (Zq-  1)  satisfies  the  following  assumption  (D  ) with  the  process 

{^(x),  X 2 o). 

/ *\ 

Assumption  (D  ): 

For  any  finite  collection  the  distribution  of 

ft1n2(xl),‘"  ^ converges  weakly  to  that  of  (ti2(x1),...  ,n2(xk),  £} 

which  is  a multivariate  normal  distribution  with  zero  means  and  covariances 
given  by 

(4.7)  cov(n2(xi),  ti2(x.j))  = VV  x^»  1 s i»  J * k 
where  Kg(x,y)  is  as  defined  in  (3. 10)  and 


Then  we  have  the  following  theorem  whose  proof  follows  on  the  lines  of 
the  proof  of  Theorem  4.1  and  is  omitted. 


Theorem  4*2 


I C M “ Tln2^x^  ‘ J*  ®(p)f(x6(p))dP)l 


Thus  {r  g (x) » x a °)  converges  weakly  in  D[0,  ®]  to  the  Gaussian 

1 

process  f^g(x)  = ti2(x)  + 5x(,f  0 (p)f(x6(p))dp),  x * 0}  which  has  mean 

o 

zero  and  covariance  function 


Kg(x,y)  + xy(  J*  e(p)f(x8(p))dp)(.f  0(p)f(y6  (p))dp) 


(y)(  J e(p)f(xe(p))dp)  + ya2(x)(  f e(p)f(ye(p))dp) 


with  K 


Now  coming  to  the  case  of  random  perturbation  factors,  let  fgr|^1 } be 
* + * 

as  in  (3.12)  and  F^x),  F^fr)  8111(1  1,6  as  ijied  and  used  in 

Theorem  3.3  . Let  (or^  },  the  non-random  constants  generated  from 
, * 

be  as  in  (3.17).  Let  Zq,  be  as  used  in  Theorem  4.1  and  satisfy  the 
condition  (D)  with  the  process  x ^ 0).  Then  we  have  the  following 

extension  of  Theorem  4.1  to  the  case  of  random  perturbations  from  RS. 


Theorem  4.3  (RS):  Let  the  conditions  (B),  (B*)  and  (D)  hold.  Let 


(4.12) 


then 

(4.13) 


Cni(x)  - p^W] 


- %!<*>  ' V'W  - V11' 


I 


Thus  ^nl^*  x 2 converges  weakly  to  the  Gaussian  process 
(Ca(x)  = ti2(x)  + §xf(x),  x a o)  defined  in  Theorem  4.1. 

* 

Finally  let  fo^}  be  the  more  general  random  perturbation  factors 

defined  in  (3-19)  and  satisfy  the  conditions  stipulated  there.  Let  fa  ) 

ni2  * 

the  non- random  constants  ’generated’  by  (<x*i2)  be  as  defined  in  (3.9). 

^ Fn2^»  Fn2^x^  and  1Tn2^  be  as  defined  and  used  in  Theorem  3.4. 
* 

***  Zn*  ^n  be  88  U8ed  111  Theorem  4.2  and  satisfy  the  condition  (d*)  with 
* 

the  process  (Tln2(x)>  * a 0).  Then  the  following  theorem  can  be  deduced 
from  Theorem  4.2  in  the  usual  way. 


Theorem  4.4:  let  the  conditions  (b),  (B*),  (C)  and  (D*)  hold.  Let 

ft-3*)  C^W  - yi[P^(iZ*)  - F^WJ. 

Then 

(l,'15)  0iZ.  W**  ' Vn2(x)- Snx(4 Le(p)f(»(p))ap)l  - op(l). 

Thus  the  process  f^n2  (x),  x a 0}  converges  weakly  in  D[0,  ®]  to  the 
Gaussian  process  {c2(x),  x * 0)  defined  in  Theorem  4.2  with  mean  zero 
and  covariance  function  given  by  (4.11). 
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5.  Asymptotic  distributions  of  the  empirical  d.f. 's  of  normalised  and 
modified  s pacings  and  tests  based  on  them. 

In  this  section,  we  relate  the  results  of  the  last  two  sections  to 
the  spacings  statistics.  First  we  give  the  asymptotic  distributions  of 
the  empirical  d.f. 's  of  the  normalised  spacings  Hq(x)  and  of  the  modified 

*N> 

spacings  Hn(x),  defined  in  (2. 21)  and  (2.26)  respectively  using  the  dis- 
tributional equivalences  (2.23)  and  (2.27).  We  then  establish  the  asympto- 
tic normality  of  some  classes  of  test  statistics  based  on  these  spacings. 

We  shall  first  consider  the  empirical  d.f.  of  the  normalised  spacings 
Hq(x),  which  from  (2.23)  is  distributionally  equivalent  to  PjxWj. 

The  r.v. 's  W^,W2, . . . have  the  exponential  d.f. 

(5.1)  F(x)  = 1 - e"x,  x * 0 

which  satisfies  all  the  regularity  conditions  of  Theorem  4.1  and  the  as- 

* 

sumptions  (B)  and  (B  ).  Further  the  (o^  ) used  in  the  definition  of 
F^(x)  satisfy  the  conditions  (3.12),  (3*13)  and  (3*14)  with  8(p)  and 
y(p)  given  by  (2.17)  and  (2.l8).  Hence  we  have 

1 1 

(5.2)  J 0(p)dp  = J i(p)dp  = 0 

o o 

J y(p)dp  = - X l(p)A'(p)dP  - X A2(p)dp  = X P2(p)dp. 

o o o o 

Let 

(5.3)  c*  oo  * /n  [Hnoo  - iJjOOl 

nl 

~ y:  tf^(x5n)  - 4001 


where 


(1- e"x)  + (j  jt2(p)dp)e"x(x-x2/2)/^/n  for  d = lA 
' o 


ignoring  terms  which  are  of  smaller  order  than  n / uniformly  in  x. 

Further  since  the  random  scale  factor  here  is  Wn,  assumption  (D)  is  sat- 
isfied and  a^x),  defined  in  (k.2)  is  easily  seen  to  be  (-xe”x).  In  view 
of  these  remarks  we  have  the  following  theorem  as  a consequence  of  Theorem 
k.3  as  in  RS. 

* 

Theorem  5«1  (RS):  The  sequence  of  stochastic  processes  {^^(x),  x * 0) 
converges  weakly  to  a Gaussian  process  (^(x),  x > 0}  with  mean  zero  and 
covariance  function 

(5-5)  K3(x,y)  = e"y(l-  e'X  - xye"x),  x * y. 

The  invaviance  principle  may  be  invoked  to  obtain  the  limit  distributions  of 
various  functionals  of  (x)  and  their  ARE' s computed,  as  was  done  in  SR. 

Consider  now  the  modified  spacings 
(5.6)  D*  = nDi/h^ 

where  h . satisfy 


(5-7)  sup  /n  | h . - h(i/n)|  = o(l) 

i n 

where  h(p)  is  a function  on  [0,1]  having  at  most  a finite  number  of 
discontinuities.  Then  the  empirical  d.f.  [H  (x),  x * 0)  of  [D-,...,D  } 


defined  in  (2.26)  is  distributi anally  equivalent  to  ( F*2 (xW^),  x a 0} 

* 

where  Fn2(x)  is  the  empirical  d.f.  of  the  exponentially  distributed  r.v. 's 

* 

W^,...,Wn  perturbed  by  the  random  factors  i = l,...,n)  which  have 

the  structure  defined  in  (3. 19),  i.e. , 


a. 


ni2  ni 

* 


e«<(1+  Bni/n&  + Yni/n  ^ + Rni) 


where  9ni, 

Bni»  vni  and  Rni  satisfy  (3*20) 

respectively  with 

6(p)  = h(p) 

(5.8) 

0(p)  = v(p) 

y(p)  = -L(p)jt/(p),  osps 

Let 

(5-9) 

= V^CH*(x)-  F^(x)] 

where 

F^(x)  = J (l-e'^^dp  if  ft  > 1 

= J*  (1- e-xh^)dp  + (j  xe’^^j^pMpJdp)/^ 

o v o 

(5.10)  If  1A  <5-  1/2 

= .f  (1-  e-xh^p^)dp  + ( f xe_xh ^ i(p )h (p )dp^/ n1^ 
o o 

+ .r  r-xL(p)£,(p)h(p)e"3dl^p^  - x2h2(p)jt2(p)e"xh^PV2ldp/n2 

oL  J 

if  ft  = lA 

to  terms  of  order  n"  ^2  1 As  an  inmediate  consequence  of  Theorem  4.4, 
since  a2(x)  defined  in  (4.8)  is  ^-x  J h(p)e_xh^p^dp^,  we  have  the  following 


J J 


(5.13) 


V^n[TQ-  J*  m'(x)(l-  F^2(x))dx  + m(o)] 


where  F*2(x)  is  defined  in  (5-10),  converges  weakly  to  the  normal  dis- 


tribution with  mean  zero  and  variance 


(5-14) 


00  oo 


J J Vx,y)mW(y)axdy 


o o 


where  K^(x,y)  is  as  defined  in  (5.H). 

This  theorem  covers  a very  wide  range  of  statistics  based  symmetrically 
on  modified  spacings. 


i 
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6.  Asymptotic  relative  efficiences  of  tests  based  on  modified  spacings. 

The  Pitman  asymptotic  relative  efficiency  (ARE)  of  a test  relative  to 
another  test  is  defined  as  the  limit  of  the  inverse  ratio  of  sample  sizes 
required  to  obtain  the  same  limiting  power  at  a sequence  of  alternatives 
converging  to  the  null  hypothesis.  This  limiting  power  should  be  a value 
in  between  the  limiting  size,  a and  the  maximum  power  1,  in  order  that 
it  can  give  an  insight  into  the  power  behaviour  of  the  test.  If  the  limiting 
power  of  a test  at  a sequence  of  alternatives  is  at  then  its  ARE  with 
respect  to  any  other  test  whose  limiting  power  (with  same  size)  is  greater 
than  a,  is  zero.  On  the  other  hand,  if  the  limiting  power  of  a test  at 
a sequence  of  alternatives  converges  to  a number  in  the  interval  (o',  l), 
then  a measure  of  the  rate  of  this  convergence,  called  ’efficacy’  can  be 
computed.  Under  certain  standard  regularity  assumptions  (see,  e.g.,  Fraser 
(1957))  which  include  a condition  about  the  nature  of  alternatives,  asymptotic 
normal  distribution  of  the  test  statistic  under  these  alternatives,  etc., 
this  ’efficacy’  is  given  by 

It.  , k 

(6.1)  efficacy  = . 


Here 


and 


2 

o 


are  the  mean  and  variance  of  the  limiting  normal  dis- 


tribution under  the  sequence  of  alternatives  when. the  test-statistic  has 


been  normalised  to  have  a limiting  normal  distribution  with  mean  zero  and 
finite  variance  under  the  hypothesis.  In  such  a situation,  the  ARE  of  one 
test  with  respect  to  another  is  simply  the  ratio  of  their  efficacies. 

Using  Theorem  5*5>  we  can  now  compute  the  ARE’s  of  tests  which  are 
symnetric  in  modified  spacings.  We  defined  - nD^/h^,  i = l,...,nj- 

as  the  modified  spacings  where  the  factors  (h^  ] satisfy  the  condition 


(6.2) 


sup(/n  | h . - h(i/n) I = o(l). 
i “■ 

If  m be  ary  function  on  [0,  ®]  satisfying  the  conditions  of  Theorem  5*5 
we  define  a symmetric  statistic  based  on  the  modified  spacings 

n 

(6.3)  T*  = £ m(D*i)/n 

i=l 

* 

The  mean  under  the  hypothesis  of  this  Tq  is,  say 

(6.4)  n*  = i*  / dxdp 

o o 

and  under  the  alternatives  (2.1)  say 

n,  = u if  3 > l/2 

pln  pon  ' 

= + A(m,  L,  h)/n^,  say  if  l/4  < 3 * l/2 

(6.5)  = n*n  + A(m,  L,  hj/n1/4  + B(m,  L,  hj/n1/2,  say  if  3 = lA 
where 

A(m,  L,  h)  = - J J*  m/(x)xA(p)h(p)e"ril^P^dxdp 
o o 

B(m,  L,  h)  = J*  J*  m/(x)e-xh^p^[xL(p)je,(p)h(p) 
o o 

(6.6)  + (x2/2)/(p)h2(p)]dxdp. 

, * 

If  A(m,  L,  h)  0,  then  the  sequence  of  tests  based  on  Tr  can  distinguish 

-I/2 

alternatives  of  the  form  (2.1)  at  a distance  of  order  n ' from  the 
hypothesis.  This  shows  that  such  tests  have  a better  performance  than  tests 
considered  earlier  which  are  symmetric  in  the  normalised  spacings.  However 
there  is  no  surety  that  A(m,  L,  h)  -f  0 for  all  L.  Consider  the  following 
example.  Let 
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hni  = n/(»"  i + 1) 
h(p)  * l/(l-p),  0 * p < 1 

D*  = (n-i  + l)^ 

m(x)  = x. 

n 

* r-  , *.  , 

Tn  = 

1 

n 

= ^ [ (n-  i + ljD^/n 
i=l 
n 

= X Vn  + 1/n- 

i=l 


A simple  computation  shews 

1 

(6.9)  A(m,  L,  h)  = J*  pjfc  (p)dp 

o 

which  is  times  the  excess  of  the  mean  under  the  alternative  over  that 

* 

under  the  hypothesis  and  is  zero  for  alternatives  under  which  T has  a 

n 

n * 

mean  1/2.  But  if  this  excess  is  non-zero,  the  test  based  on  7;  m(nD^)/n 

has  a better  performance  than  symmetric  normalised  spacings  statistics  con- 
sidered in  Theorem  5-3-  However  if  A(m,  L,.h)  = 0,  this  test  statistic  T* 

discriminates  such  alternatives,  if  at  all,  only  when  they  are  at  a distance  of 

_ 1 

n u,  which  puts  this  on  par  with  the  symmetric  spacings  tests. 

But  it  should  be  remarked  that  one  can  always  construct  tests  based 
symmetrically  on  modified  spacings  which  have  the  ability  to  detect  alter- 
natives at  a distance  of  n~  ^ . This  is  because  in  testing  the  hypothesis 
of  uniformity  against  the  fixed  sequence  of  alternatives  G^(x)  = x + L(x)/n^, 


the  test  statistic 


n 

(6.10)  T = Y i(i/n+l)D 

i=l 

1 2 

with  h(p)  = l//(p)  and  m(x)  = x,  has  A(m,L,h)  = J l (p)dp  ^ 0.  Sane 

0 

recent  investigations  hy  Holst  and  Rao  (1978)  indicate  that  tests  of  the  form 
(6.10)  provide  the  locally  most  powerful  spacings  tests  for  uniformity  against 
the  fixed  sequence  of  alternatives  Gq(x). 
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